Improved accuracy and less fault prediction errors via modified sequential minimal optimization algorithm

The benefits and opportunities offered by cloud computing are among the fastest-growing technologies in the computer industry. Additionally, it addresses the difficulties and issues that make more users more likely to accept and use the technology. The proposed research comprised of machine learning (ML) algorithms is Naïve Bayes (NB), Library Support Vector Machine (LibSVM), Multinomial Logistic Regression (MLR), Sequential Minimal Optimization (SMO), K Nearest Neighbor (KNN), and Random Forest (RF) to compare the classifier gives better results in accuracy and less fault prediction. In this research, the secondary data results (CPU-Mem Mono) give the highest percentage of accuracy and less fault prediction on the NB classifier in terms of 80/20 (77.01%), 70/30 (76.05%), and 5 folds cross-validation (74.88%), and (CPU-Mem Multi) in terms of 80/20 (89.72%), 70/30 (90.28%), and 5 folds cross-validation (92.83%). Furthermore, on (HDD Mono) the SMO classifier gives the highest percentage of accuracy and less fault prediction fault in terms of 80/20 (87.72%), 70/30 (89.41%), and 5 folds cross-validation (88.38%), and (HDD-Multi) in terms of 80/20 (93.64%), 70/30 (90.91%), and 5 folds cross-validation (88.20%). Whereas, primary data results found RF classifier gives the highest percentage of accuracy and less fault prediction in terms of 80/20 (97.14%), 70/30 (96.19%), and 5 folds cross-validation (95.85%) in the primary data results, but the algorithm complexity (0.17 seconds) is not good. In terms of 80/20 (95.71%), 70/30 (95.71%), and 5 folds cross-validation (95.71%), SMO has the second highest accuracy and less fault prediction, but the algorithm complexity is good (0.3 seconds). The difference in accuracy and less fault prediction between RF and SMO is only (.13%), and the difference in time complexity is (14 seconds). We have decided that we will modify SMO. Finally, the Modified Sequential Minimal Optimization (MSMO) Algorithm method has been proposed to get the highest accuracy & less fault prediction errors in terms of 80/20 (96.42%), 70/30 (96.42%), & 5 fold cross validation (96.50%).


Introduction
PaaS. Faults can occur at any of these layers, but software-level recovery procedures are discovered and used.
Ahmed et al. [8] cloud technology is defined as the various software runtimes used on cloud computing platforms such as Hadoop, Dryad, and communication frameworks such as HDFS (Hadoop Distributed File System), Amazon S3, and others. Today, there are numerous cloud service apps available that are primarily used on CC platforms such as Nimbus and Eucalyptus, allowing various enterprises to build clouds to improve resource performance efficiency. The use of cloud computing system technology broadens several parallel processing capabilities.
Kamiri & Mariga [9] suggested, that ML is a subfield of artificial intelligence that deals with the creation of algorithms and procedures that allow a computer to learn and gain intelligence through experience. The research methodology used in machine learning research is critical because it influences the accuracy and dependability of the results. Machine learning models learn from historical data, which can be primary or secondary in nature. As a result, there is a vast knowledge base from which robots can learn and make decisions.
Sarker [10] based on sample input-output pairs, the process of learning a function that translates input to output was introduced. To infer a function, it uses labeled training data and a set of training examples. Supervised learning occurs when specific goals are specified to be achieved from a specific set of inputs, i.e., a task-driven method. Classification (data separation) and regression are the most commonly supervised tasks (fitting data). Supervised learning, for example, is used to predict the class label or sentiment of a piece of text, such as a tweet or a product review.
Butt et al. [11] investigate that ML is the logical evaluation of computations and quantifiable models used by computer systems to perform a particular attempt without the need for explicit instructions, based on models, and acceptance. It falls under the umbrella of computerized reasoning. Machine learning is so important in the cloud that it will be used by all clouds soon.
Sun et al. [12] suggested that ML has recently grown at a breakneck pace, attracting a large number of academics and practitioners. It has emerged as one of the most prominent research areas, with applications in a wide range of industries, including machine translation, speech recognition, image recognition, recommendation systems, and so on.
Kochhar et al. [13] the NB classifier are one of the most useful machine learning algorithms. The NB classifier is based on the Bayes theorem, which requires significant independence (nave) between qualities or features (predictors). Because it requires little work to develop and has no complicated repeating parameter setting or computation, the Naive Bayesian classification model is very useful for very large datasets. Despite its simplicity, the Nave Bayes classifier is one of the most widely used algorithms because it frequently outperforms and outperforms more complicated and refined classification algorithms.
Chang & Lin [14] proposed that LIBSVM is a Support Vector Machines library (SVM). The goal is to make applying SVM to applications as simple as possible for users. LIBSVM has been widely used in machine learning and other fields. LIBSVM has grown to be one of the most widely used SVM programs. LIBSVM provides support for a variety of SVM formulations for classification, regression, and distribution estimation. LIBSVM is widely used in numerous fields.
Mohamad [15] proposed that Proposed based on many independent factors, multinomial logistic regression is used to estimate the probability of multiple possible outcomes for a categorical dependent variable with more than two categories. The MLR model compares various categories using a combination of binary logit models. The multinomial logit model is composed of k-1 binary logit models that assess the influence of predictors on the likelihood of success in that category for k response variable categories. C.R. LI & J. GUO [16] proposed that the SMO limits B to only two multipliers that can be calculated analytically and don't require any extra matrix storage. There are two methods for determining which multipliers to optimize. The first heuristic prioritizes unbound multipliers that are more likely to violate the KKT specifications. The second choice heuristic, after selecting the first Lagrange multiplier, selects the second Lagrange multiplier that maximizes the difference between the two prediction errors. To save training time, the SMO technique is based on a single program multiple data (SPMD) paradigm. It divides the entire dataset into smaller subsets and uses several processors to update the error array of each subset in parallel.
Sen et al. [17] suggested that the K-nearest neighbor saves all available records and predicts the class of new occurrences in probability using similarity measures from the nearest neighbors. Unlike other classification techniques that construct a mapping function or internal model, this classification technique is known as a lazy learning method because it stores the data members in inefficient data structures such as hash tables, reducing the computation cost to check and apply the appropriate distance function between the new observation and all k number of different data points stored and then come to any conclusion about the label of the new data point. The results are generated by applying simple majority support to the K-nearest neighbors of each new data point.
Sarker [10] proposed a Random Forest classifier as a well-known ensemble classification approach used in machine learning and data science in a variety of application fields. This method uses a parallel ensemble, which involves fitting multiple decision tree classifiers to different data sets sub-samples in parallel with the conclusion or final result determined by majority voting or averages. Over-fitting is reduced as a result, and forecast accuracy and control are improved. As a result, the RF learning model with multiple decision trees frequently outperforms a single decision tree model. It employs a combination of bootstrap aggregation (bagging) and random feature selection to generate a series of decision trees with controlled variance.
Khazaei & Rezvani [18] this study proposes a multi-objective VM placement strategy to reduce energy costs and optimize scheduling. They present a modified memetic algorithm and compare its performance to baseline and state-of-the-art methods. The proposed method can lower energy costs, carbon footprints, SLA violations, and total IoT response time.
Bharany et al. [19] this paper critically examines FT techniques in CC systems and discusses the error, fault, and failure taxonomy. Furthermore, the purpose of this paper is to investigate many critical research topics and advanced techniques, such as AI, deep learning, the Internet of Things, and ML, that could be used as an intelligent FT strategy in the cloud environment.
Shahid et al. [20] this study compares the performance of existing load-balancing algorithms such as PSO, RR, ESCE, and throttled load balancing. Using a cloud analyst platform, this study provides a detailed performance evaluation of various load-balancing algorithms. Many of the previous papers mentioned in the literature focused on round robin and equally distributed current execution, as well as throttled load-balancing algorithms, and were based on efficiency and response time in virtual machines without taking into account the task-virtual machine relationship or the practical significance of the application. Table 5 in S1 Appendix shows a summary of the literature review in S1 Appendix section.

Problem statement
Reliability is a continuous metric that changes with each computing step. One of the most important service characteristics is reliability, which must be met in cloud computing for a stable operation. The dependability of overall task completion is the result of specific activities, & for too many thousands or millions of computing operations, this can quickly become a fading variety. A cloud system's reliability is an assessment of how effectively the cloud system provides the service to the user based on the criteria listed above [21].
There is a need to design & implement ML models that can resolve low accuracy and high fault prediction error issues by acquiring high accuracy, and less fault prediction error.

The objective of the research
This study aims to help users with QoS in a CC environment by using ML to achieve high accuracy and lower fault prediction error. To accomplish this, the following goals must be met: • There is a need to identify the best ML-based fault prediction model to improve the accuracy and failure prediction.
• Propose an ML-based model to address low accuracy and high fault prediction errors.

Research methodology
This section focuses on research methodology. Classification, research design, data collection procedure, exploratory data analysis, data pre-processing, data analysis techniques, and proposed algorithm have all been thoroughly explained in this section. Research design. The following research design has been followed: Proposed model. In this subsection, we propose our fault classification & prediction model. Fig 1 visualizes the whole research process reported in this paper. We will train our model using secondary & primary datasets and will do fault classification & prediction on the target datasets. By following this approach, we will identify which ML classifiers give the highest accuracy and less fault prediction error in terms of accuracy, prediction, & data validation by classes.
Furthermore, in this section, the data collection & data generated procedure has been explained in detail as well. Fig 1 shows the implementation view of the research framework. (Fig 1) demonstrates that a genuine, competent, and effective solution has been designed to achieve more accuracy and less fault prediction error in cloud computing from the secondary and primary datasets.
Classification. Classification will be used to identify the fault status (True/False and Repair/ Failure) with greater accuracy and lower fault prediction error. Classification is a supervised learning approach in machine learning that refers to a predictive modeling problem that predicts a class label for a given sample. It converts an input variable (X) to an output variable (f) as a goal, label, or category (Y). For example, spam detection by email service providers, such as "spam" and "not spam," could pose a classification challenge [11]. Various algorithms will be used to determine the best result among four directories of the secondary dataset for classification. For the primary dataset, we will use one CSV file to classify various algorithms to find the best algorithm for modification.
Acquired secondary data. We acquired (Antarex HPC Fault Dataset) secondary data through the ZENODO website and this dataset is published in articles. This dataset and all test environment details are publicly available for use by the community. The Antarex secondary dataset is based on trace data from the homonymous experimental HPC system at ETH Zurich during fault injection, which is used to undertake ML-based fault prediction studies for researchers [22]. Tables 6-9 in S1 Appendix shows the details of the CPU-Mem Mono (Instances 4005), CPU-Mem Multi (Instances 4380), HDD Mono (Instances 3244), and HDD Multicores (Instances 2493) dataset in S1 Appendix section. This dataset block has 8 attributes (timestamp, type, args, seqNum, duration, cores, error, and isFault) and various instances. These instance types are numeric and nominal bases [22].
Exploratory data analysis on secondary dataset. We performed Exploratory Data Analysis (EDA) on Antarex secondary dataset. The purpose of EDA is essential to tackle specific tasks such as spotting missing and erroneous data, mapping and understanding the underlying structure of your data, and identifying the most important variables in the dataset. The dataset is separated into two sections, one for CPU and memory-related benchmark apps and fault programs, and another for hard drive-related applications and fault programs. Antarex datasets have four folders, one for each dataset block, namely CPU/Memory and HDD, in singlecore and multi-core forms [22].
Data pre-processing on the secondary dataset. Data pre-processing is necessary before applying ML algorithms to secondary datasets. This dataset has duplicate values in 3 attributes named args, seqNum, and duration. Furthermore, this dataset has some none values and empty rows. All duplicate values, none values, and empty rows are removed using the Remove Duplicates option in excel. Tables 10-13 in S1 Appendix shows the details after applying data pre-processing of the CPU-Mem Mono (Instances 1740), CPU-Mem Multi (Instances 1408), HDD Mono (Instances 568), and HDD Multicores (Instances 551) in S1 Appendix section.
Generated primary data. We have generated a primary dataset through the Weibull distribution approach. The Weibull distribution is also often employed as a time-to-failure model for reliability. It extends the exponential model by including non-constant failure rate functions. This contains both rising and falling failure rate curves and has been successfully utilized to explain both initial burnings and wear-out failures [4]. We have coded different parameters in the java platform for primary data generated using the Weibull distribution approach. Table 1 is a summary of the parameters of the primary dataset generation. The primary dataset is shown in Table 2. Table 2

Data analysis techniques
Different ML-based techniques have been used in this study for fault classification and prediction. Fault classification and prediction are carried out using various classifiers from NB, LibSVM, MLR, SMO, KNN, and RF algorithms.
Naïve bayes. The NB classifier represents, employs, and learns probabilistic knowledge with well-defined semantics. The method is intended for supervised induction tasks where the performance goal is to correctly predict the class of test cases and the training examples include class information. A naive classifier is a type of Bayesian network that is built on two basic simplifying assumptions. It assumes, in particular, that the predictive qualities are conditionally independent of the class and that no hidden or latent features influence the prediction process. As a result, the graphic shape of a naive Bayesian classifier is shown in (Fig 2), with all arcs pointing from the class attribute to the observable, predictive attributes [23].
The Bayes' rule is used in Eqs 1 to 3 to compute the probability of each class given a vector of observed values for the predictive qualities and then predicts the most likely class.
Let C be the random variable representing the class of an instance, & X be a vector of random variables representing the observed attribute values. Furthermore, let c denote a specific class label & x denote an observed attribute value vector.
For continuous attributes, we can write the probability density function for a normal (or Gaussian) distribution.
Library support vector machine. LIBSVM is a library for SVMs. The goal is to make it as simple as possible for users to apply SVM to their applications. LIBSVM has been widely adopted in ML and a variety of other fields. LIBSVM is frequently used in two steps: training a data set to generate a model, followed by using the model to predict information from a testing data set. LIBSVM supports numerous SVM formulations for classification, regression, and distribution estimation. (Fig 3) depicts the code organization of LIBSVM for training [14]. In Eq 4 where e = [1,. . ., 1]T is the vector of all ones, Q is an l by l positive semidefinite matrix, Qij � yiyjK(xi, xj), and the kernel function is as follows: Multinomial logistic regression. Softmax is an abbreviation for MLR. Because of the hypothesis function it employs, regression is a supervised learning technique that can be used to solve a variety of problems, including text categorization. It is a regression model that applies logistic regression to classification problems with multiple possible outcomes [24]. The Multinomial Logistic Classifier is depicted in (Fig 4).
In Eq 5 MLR is employed where the objective function of the classifier is given as above.

Sequential minimal optimization
To train an SVM, a very large quadratic programming (QP) optimization problem must be solved. SMO divides the enormous QP problem into the smallest feasible QP problems. These minor QP issues are handled analytically, which eliminates the need for a time-consuming numerical QP optimization as an inner loop. SMO's memory requirements scale linearly with  training set size, allowing it to handle extremely large training sets. SMO scales the training set size for various test problems somewhere between linear and quadratic because matrix computation is avoided, whereas the traditional chunking SVM technique scales the training set size somewhere between linear and cubic. Because SVM evaluation consumes the majority of SMO's computing time, SMO is the fastest for linear SVMs and sparse data sets. SMO can be more than 1000 times faster than chunking in real-world sparse data collections [26]. (Fig 5) depicts the overall architecture of SMO inference and training. In Eqs 6 to 8 the QP problem for training an SVM is: In Eq 6 the QP problem for training an SVM is maximized and subject to 7 & 8.
K-nearest neighbor. The KNN classification method is widely used. It is widely used because of its simplicity and quick calculation time [28]. The choice of value k is critical in this method, as shown in (Fig 6). The two parameters that must be accessible to different k values are training and validation error rates [29].
• Determine the parameter K defining the number of nearest neighbors [29].
• Calculate the distance between the query and all training examples [29].
• Using the kth minimum, sort the distance and find the closest neighbors [29].
• Use the majority in the category of nearest neighbors as the instance's prediction value [29].
ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi X k To calculate the NNs, the Fine and Medium KNN algorithms employ the Euclidean distance function, as indicated in Eqs 9 and 10.
Random forest. This method generates a large number of collaborative decision trees. In this algorithm, decision trees serve as pillars. RF is a set of decision trees that were defined during the pre-processing stage. After constructing many trees, the best feature from a random subset of features is chosen. Another idea generated by the decision tree algorithm is the creation of a decision tree. As a result, these trees combine to form a random forest, which is used to classify new objects based on the input vector. Each built decision tree is used to categorize. (Fig 7) depicts the flowchart of a random forest classifier [30]. The mathematical formula for random forest classifiers is shown below in Eq 11. n ij ¼ w i C j À w leftj C leftj À w rightj C rightj n i subðjÞ ¼ the importance of node j w subðjÞ ¼ weighted number of samples reaching node j C subðjÞ ¼ the impurity value of node j leftðjÞ ¼ child node from left split on node j rightðjÞ ¼ child node from right split on node j Parameters configuration of ML classifiers. ML classifiers have been configured by applying different parameters to achieve accuracy and fault prediction by class. Table 3 shows the different parameters of ML classifiers with values.

Modified sequential minimal optimization
The original SMO algorithm has low accuracy & a high fault prediction error. This research has to resolve low accuracy & high fault prediction errors by acquiring high accuracy with less fault prediction error from MSMO. The block diagram of an MSMO classifier is shown in (Fig 8). High accuracy & less fault prediction errors are based on the primary dataset that has been generated. High accuracy & less fault prediction error have been evaluated min α1, α2 using an objective function. High accuracy & less fault prediction error have been made by applying objective functions through algorithm parameters & kernel parameters. The C parameter has been determined as a trade-off between fitting the training data & maximizing the separating margin. C has a value between 0.01 & 100. The random seed has been set at 2. The only parameter for the polynomial kernel is the exponent, which has controlled the degree of the polynomial. By default, the kernel has computed the exponent as (x * y).
The mathematical formula for modified sequential minimal optimization classifiers is shown below in Eqs 12-15.
In Eq 12 training, testing & validation is defined as above.

a1:a2 ð13Þ
In Eq 13 the objective function is defined as above.
In Eq 14 polynomial kernel is defined as above.
ðx þ y þ 1Þ^ð15Þ In Eq 15 exponent & lower order are defined as above.

Results and findings
This section includes data analysis and classification results from NB, LibSVM, MLR, SMO, and RF with a confusion matrix, as well as graphical representations of the results. Finally, the MSMO results, which are the main algorithm of this research study, are included here. This study compares conventional ML algorithms to achieve high accuracy and less fault prediction errors. The secondary dataset archive contains four directories, one for each dataset block, namely CPU/Memory and HDD in single-core and multi-core variants [3]. Based on the results, a significant difference has been observed in the four directories of the secondary dataset, with CPU-Mem Multi cores outperforming the remaining directories such as CPU-Mem Mono, HDD Mono, and HDD Multi.
According to the comparisons, the primary dataset outperforms the secondary dataset, so in this study, the primary dataset results were sufficient to consider when modifying the ML algorithm.

PLOS ONE
Improved accuracy and less fault prediction errors via modified sequential minimal optimization algorithm & Primary). The results of NB, LibSVM, MLR, SMO, KNN, and RF are compared in terms of accuracy, fault prediction error, and data validation by class using Eqs 16 to 26. Results from a secondary dataset (CPU-Mem Multi) demonstrated that NB outperformed LibSVM, MLR, SMO, KNN, and RF. Furthermore, the results of the primary dataset demonstrated that RF outperformed, but the time complexity is poor. According to the primary dataset results, RF and SMO have minor point value differences, but SMO has good time complexity. The software environment we used is WEKA 3.8.6 with Remove Percentage Filter. In Eq 16 the accuracy is defined as above.

Recall or TrueÀ Positive Rate
In Eq 17 the recall or true positive rate is defined as above. In Eq 18 the true negative rate is defined as above.
In Eq 19 the precision is defined as above.
In Eq 20 the false positive rate is defined as above.
In Eq 21 the Matthews correlation coefficient is defined as above.
In Eq 22 the F-measure is defined as above.
• The RMSE is a commonly used measure of the difference between predicted & observed values by a model or estimator [31].
• MAE is a distinct measure of two continuous variables [31].
• The relative absolute error normalizes the total absolute error by dividing it by the total absolute error of the simple predictor [32].
• The relative squared error normalizes the total squared error by dividing it by the simple predictor's total squared error [32].
In Eq 23 the RMSE is defined as above.
In Eq 24 the MAE is defined as above.
In Eq 25 the RAE is defined as above.
E i ¼ ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi P n j¼1 ðP ðijÞ À T j Þ In Eq 26 the RSE is defined as above.

Comparison of classification models on a secondary dataset
We are presenting results associated with different classifiers using ISFAULT in secondary data and STATUS in primary data. As classification models, we opted for an NB, LibSVM, MLR, RF, KNN, and SMO with a poly kernel. The secondary data and primary data results of each classifier are shown in (Figs 9-72    The confusion matrix is used to calculate Accuracy, Precision, Recall, and F-Measure. It is used as an efficient technique for the classification of attributes based on qualitative response categories. (Figs 45-50) show the confusion matrix related to accuracy & fault prediction, achieved through NB, LibSVM, MLR, SMO, KNN, and RF. The following confusion matrix indicates that the SMO classification model gives the highest percentage of accuracy & less fault prediction on HDD Mono.

PLOS ONE
Improved accuracy and less fault prediction errors via modified sequential minimal optimization algorithm

Comparison of classification models on a primary dataset
We are presenting results associated with different classifiers using the STATUS class in the primary dataset. For classification models, we opted for NB, LibSVM, MLR, RF, KNN, & SMO with the poly kernel.
The RF classifier gives the highest percentage of accuracy and less fault prediction in terms of 80/20 (97.14%), 70/30 (96.19%), and 5 folds cross-validation (95.85%) in the primary data results, but the algorithm complexity (0.17 seconds) is not good. In terms of 80/20 (95.71%), 70/30 (95.71%), and 5 folds cross-validation (95.71%), SMO has the second highest accuracy and less fault prediction, but the algorithm complexity is good (0.3 seconds). The difference in accuracy and less fault prediction between RF and SMO is only (.13%), and the difference in time complexity is (14 seconds).  SMO gives the second-highest accuracy and less fault prediction but the algorithm complexity is good (0.3 seconds). The accuracy and less fault prediction difference between RF and    The confusion matrix is used to calculate Accuracy, Precision, Recall, and F-Measure. It is used as an efficient technique for the classification of attributes based on qualitative response categories. (Fig 93) shows the confusion matrix related to accuracy & fault prediction, achieved

PLOS ONE
Improved accuracy and less fault prediction errors via modified sequential minimal optimization algorithm through MSMO. The following confusion matrix indicates that the MSMO classification model gives the highest percentage of accuracy & less fault prediction error on the primary dataset against NB, LibSVM, MLR, SMO, KNN, and RF. (Fig 94) represents the error of the classifier which shows the values corresponding to true positive, true negative, false positive, and false negative values. In (Fig 94) the square box represents the errors in the actual class versus the predicted class.

Discussion
This study was carried out to achieve high accuracy and less fault perdition to achieve reliability. The MSMO classifier was created to ensure the smooth execution of the research.  An MSMO classifier was applied to the primary data. According to the MSMO classifier results, the proposed strategy outperforms the existing classifier in terms of accuracy and fault prediction. The obtained results in the primary data were compared to the existing NB, LibSVM, MLR, SMO, KNN, and RF classifiers. High accuracy with low fault prediction is the most important parameter for judging the classifier's performance level.
Simulated results were compared to NB, LibSVM, MLR, SMO, KNN, and RF classifiers, and it was demonstrated that the proposed classifier performed more accurately and quickly, with 96.5% of instances correctly classified than the available classifiers. The proposed research's innovation is a collection of techniques that have been linked to high accuracy and less fault prediction to achieve reliability. MSMO was proposed using parameter tuning, which is considered a novel approach.

PLOS ONE
Improved accuracy and less fault prediction errors via modified sequential minimal optimization algorithm

Contributions
Our work makes numerous contributions. We began by acquiring the HPC fault dataset and evaluating a fault classification method based on supervised machine learning. This dataset and all test environment details are publicly available for use by the community. The Antarex secondary dataset will be based on trace data from the homonymous experimental HPC system at ETH Zurich during fault injection, which will be used to undertake machine learning-

PLOS ONE
Improved accuracy and less fault prediction errors via modified sequential minimal optimization algorithm based fault prediction studies for researchers. The dataset will be separated into two sections, one for CPU and memory-related benchmark apps and fault programs, and another for hard drive-related applications and fault programs. Antarex dataset will have four folders, one for each dataset block, namely CPU/Memory and HDD, in single-core and multi-core forms [3].
Second, we generated a primary dataset through the Weibull distribution approach. The Weibull distribution is also often employed as a time-to-failure model for reliability. It extends the exponential model by including non-constant failure rate functions. This contains both rising and falling failure rate curves and has been successfully utilized to explain both initial burnings and wear-out failures [4]. We have coded different parameters in the java platform for primary data generated using the Weibull distribution approach. Third, our analysis is based on (Antarex Secondary Datasets) obtained from the ZONODO website and (Primary Datasets) generated using the Weibull distribution approach [3,4]. We present the results of our experiments, which were designed to determine which ML algorithms provide better results in terms of high accuracy and less fault prediction.
As a fourth & final contribution, high accuracy, and less fault prediction error, have been achieved using the MSMO classifier which has a good impact on users related to the CC environment.

Conclusions
The results of this study were associated with various classifiers using ISFAULT in secondary data and STATUS in primary data to achieve high accuracy and less fault prediction errors.  In the primary data results, the RF classifier has the highest percentage of accuracy and less fault prediction 80/20 (97.14%), 70/30 (96.19%), and 5 folds cross-validation (95.85%), but the

Contribution towards cloud computing
The SMO classifier has been modified. Our proposed approach improves accuracy and gets fewer fault prediction errors for users in cloud computing environments. It was not an easy task to achieve high accuracy and less fault prediction to reliability to the existing approach. Almost all tuning parameters C parameter, random seed, kernel exponent, and lower order values have been adjusted to get better results in terms of accuracy, mean square error, and better fitness.